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Abstract 

The first output-sensitive algorithm for the Maximal Clique Listing problem was given 
by Tsukiyama et al. in 1977. As any algorithm falling within the Reverse Search paradigm, 
it performs a DFS visit of a directed tree (the RS-tree) having the objects to be listed (i.e., 
maximal cliques) as its nodes. In a recursive implementation, the RS-tree corresponds to the 
recursion tree of the algorithm. The time delay is given by the cost of generating the next 
child of a node, and Tsukiyama showed it is 0{mn). In 2004, Makino and Uno sharpened 
the time delay to 0(n‘^) by generating all the children of a node in one single shot, which 
is performed by computing a square fast matrix multiplication. In this paper, we further 
improve the asymptotics for the exploration of the same RS-tree by grouping the offsprings’ 
computation even further. Our idea is to rely on rectangular fast matrix multiplication in 
order to compute all the children of nodes in one single shot. According to the current 
upper bounds on square and rectangular fast matrix multiplication, with this the time delay 
improves from to 

Keywords: Maximal Clique Listing, Rectangular Fast Matrix Multiplication, Output Sensitive 
Algorithm, Polynomial Time Delay, Reverse Search Enumeration, Backtracking. 


1 Introduction 


In an undirected graph G, a clique is any subset K of the vertex set such that any two vertices 
in K are adjacent. A clique is maximal when it is not a subset of any larger clique. This 
paper addresses the problem of generating all the maximal cliques of a given graph, namely 
Maximal Clique Listing (MCL). Maximal cliques are fundamental graph objects, so the MCL 
problem may be regarded as one of the central problems in the field of graph enumeration, and 
indeed it attracted a considerable attention also in the past 11 liii- The problem has not only 


theoretical interest in computational complexity, but it possesses several consolidated applications 
as well, e.g., in bioinformatics, clustering, computational linguistics and data-mining SBi- 

As shown by Moon and Moser in 1965, any graph on n vertices contains at most 3"/^ maximal 
cliques [l^ . It is therefore particularly interesting to focus on polynomial time delay algorithms for 
generating all of them without repetitions. An MCL algorithm has 0{f{n)) time delay whenever 
the time spent between the outputting of any two consecutive maximal cliques is 0(/(n)); for 
this, the procedure is allowed to undertake a polynomial time pre-processing phase, if needed. 

Both in the past and more recently, a considerable number of algorithms have been presented 
and evaluated (experimentally or theoretically) for MCL. Tsukiyama, et al. [H first proposed 
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in 1977 a polynomial time delay solution for generating all maximal independent sets (thus, by 
complementarity, all maximal cliques) in a given graph G = {V,E). Their procedure works 
with 0{n + m) space and 0{mn) time delay. Here, m = \E\ and n = \V\. In 1985, Chiba 
and Nishizeki reduced the time delay to 0{'j{G)m), where 7 (G) is the arboricity of G and 
m/{n — 1) < 7 (G) < Johnson, Yannakakis and Papadimitriou Q proposed in 1988 an 

algorithm for enumerating all the maximal cliques in the lexicographical order. Their procedure 
runs with 0{mn) time delay, but it also uses 0{nN) space (where N denotes the total number 
of maximal cliques of G). A summary of previous and present results is offered in Table [T] 

Table 1: Time and Space Complexity of the main Output-Sensitive Algorithms for MCL. 


Algorithm 

Time to First x 

Time Delay 

Work Space 

Algo, m 


0(n2-(i44/2)-2) 

q(j^4.2796) 

Algo. [S] 

0(n‘"+3+xn2“(Fi.i/2)-2) 

0 (n‘^+^) 

0(n4) 

MU04 [9J 

0{xn‘^) 

0 (n“) 

O(n^) 

CN85 

0{xa{G)m) 

0{a{G)m) 

0{m + n) 

TIAS77 [11] 

O(xmn) 

Olrnn) 

0{m + n) 

JYP 88 [^ 

0{xmn) 

0{mn) 

0(mnN) 


Both the algorithm of Tsukiyama, et al. and that of Johnson, et al. can be placed within the 
framework of the Reverse Search Enumeration (RSE), a technique which was first introduced by 
Avis and Fukuda in the context of efficient enumeration of vertices of polyhedra and arrangements 
of hyperplanes [l|. Very briefly, the RSE is a technique for listing combinatorial objects by 
reversing a given optimization objective function /. Let G = (V, E) be a connected graph whose 
nodes are precisely the objects to be listed. Suppose we have some objective function f : V ^ N 
to be maximized over all nodes of G. Also, assume we are given a local search algorithm on G 
that is a deterministic procedure to move from any node to some neighboring node which is larger 
with respect to /, until there exists no better neighbor. The algorithm is finite if for any starting 
node it terminates within a finite number of steps. We may consider the digraph 7 g with the 
same node set as G and in which the edges are all the ordered pairs {x,x') of consecutive nodes 
X and x' generated by the same local search algorithm. Assuming that there is only one local 
optimal node x*, then 7 g is a single directed tree spanning all the nodes of G and having x* as 
its only sink and root. In this manner, if we trace Eg from x* backwards, say with a Depth-First 
Search, we can enumerate all nodes of G, i.e., all combinatorial objects. The major operation 
involved is tracing each edge against its orientation, which corresponds to reversing the local 
search optimization algorithm in order to compute a parent-child relation that fully describes Eg] 
notice that, in this case, the minor work of backtracking is simply that of performing a single 
local search step itself. Whence, the key ingredient of any RSE is the computation of the parent- 
child relation in an efficient way. If the height of Eg is at most n, then the memory consumed 
throughout the listing process is polynomial in n. 

Indeed, the algorithm of Tsukiyama, et al. performs a DFS visit of a directed tree - namely, 
the RS-tree - having the objects to be listed (i.e., maximal cliques) as its nodes. In a recursive 
implementation, the RS-tree corresponds to the recursion tree of the algorithm. Tsukiyama, et 
al. showed in [ll| that the time delay of their procedure is 0{mn). In 2004, Makino and Uno 
sharpened the time delay of MCL to by generating all the children of a node in one single 

shot which is performed by computing a square fast matrix multiplication [9|. In particular, the 
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procedure of Makino and Uno runs with 0{M{n)) = 0(n‘^) time delay and works with 0{ri?) 
space, where M{n) = 0{n^) denotes the minimum number of arithmetic operations needed to 
multiply two n x n square matrices. The best upper bound on ui which is currently known was 
shown by Le Gall Q in 2014, it is a; < 2.3728639. By these results, the algorithm of Makino and 
Uno runs with 0 ( 7 ^ 2 . 3728639 ^ time delay. To the best of our knowledge, this is the tightest upper 
bound on the time delay complexity of MCL which is currently known in the literature. 

Contribution. In this work we improve the tightest known upper bound on the time delay 
complexity of MCL. In particular, we show that the parent-child relation of the corresponding 
RS-Tree admits an asymptotically faster (with respect to that devised by Makino and Uno M) 
computing procedure that works by grouping the offsprings’ computation even further than in [9|. 
Briefly, our procedure works by grouping together multiple children generation problems into 
batches of problems (where each single problem consists into the task of computing all children 
of a given maximal clique) and then by reducing the job of solving a whole batch of problems, in 
one single shot, to that of multiplying two rectangular matrices. We remark that, in so doing, this 
work proposes a novel representation for the basic task of generating the children nodes for MCL. 
This conceptual shift is the essential lever which allows for a suitable adoption of rectangular 
matrix multiplication methods in MCL. In this way, we prove a sharpened upper bound on the 
time delay of MCL, improving it from 0{n^) = ( 9 ( 77 , 2 . 3728639 ^ _ ^ 1 -^.^ 2 . 093362 ^. 

here, denotes the minimum number of arithmetic operations needed to perform 

any x n hy n x matrix product, and it is the standar 4 il notation for expressing significant 
bounds on rectangular matrix multiplication. Our main results are summarized below. 

Theorem 1. There is a procedure (Algorithm\^ for listing all the maximal cliques of any given 
n-vertex graph G = {V,E), without repetitions, and in such a way that for every a:: G N the first 
X maximal cliques are outputted within the following time bound: 

Tfirsts = 0(n“+3 + 

For this, the procedure employs 0{n‘^) space. 

Theorem 2. There is a procedure (Algorithm\^ for listing all the maximal cliques of any given 
n-vertex graph G = {V,E), without repetitions, and with the following time delay: 

Uelay = = O {n^ , 

which is a worst-case upper bound on the time spent between the outputting of any two consecutive 
maximal cliques. For this, the procedure firstly performs a bootstrapping phase, whose worst-case 
time complexity is bounded as follows: 

Tioot = 0(n‘"+3) = 

Moreover, the procedure employs ( 9 (n‘^“^‘^(^ 4 .i/ 2 )-i- 6 ^ _ q|-j.j4.2796^ space. 

In passing, we shall introduce a backtracking technique, named Batch Depth-First Search 
(Batch-DFS), whose aim is to keep the search of maximal cliques going on, solving one batch of 
problems after another, consuming only polynomial space overall. An in-depth time and space 
analysis of Batch-DFS is offered, as we believe that it may be of general interest for applying a 
similar approach to some other listing problems that admit polynomial time delay algorithms. 

^Following the notation as in [3, for any k GQ such that k > 0, let C(n, be the minimum number of 

arithmetic operations needed to multiply an n x matrix by an x n one. The corresponding complexity 

exponent is defined as follows: 

a;(l, fc, 1) = inffr E M | C(n, , n) = 0{n'^)}, for every (0, H-oo) D Q. 

Notice that a;(l, 1,1) = a; is the complexity exponent of the n x n square case. As for x \n^] by [n^\ x [n^\ 
matrix products, the corresponding complexity exponent is u(i, k,j) = inffr E M | C{\rL^\^ = ^( 71 ”^)}. 


3 



6 



Figure 1: An example graph obtained by gluing together the complete graphs /C 5 and /C 3 . The 
corresponding maximal cliques are Kq = {1,2,3,4, 5}, Ki = {1,6}, K 2 = {2,7}, = {5,8}, 

Ki = { 6 , 7, 8 }. Here, 1<2<3<4<5<6<7<8 and ATq >iex Ki >iex K 2 >iex >iex K^. 

Organization. The rest of this paper is organized as follows. In Sectior(2l some background 
notation is introduced in order to support the subsequent sections. Section [3] recalls some major 
aspects of Tsukiyama et ai, Johnson et al., and Makino Uno’s solutions; in particular, the 
construction of the RS-tree 7 g is recalled and revised, as this is actually the enumeration tree 
of all the maximal cliques that we aim to list. In Section |T1 we describe our reduction from the 
problem of computing all children of any batch of nodes of Tg to that of performing rectangular 
matrix products. The Batch-DFS backtracking is introduced and analyzed in Section [5] Our 
Maximal Clique Listing algorithms are offered in Section [ 6 l Finally, Section [7] closes the paper. 

2 Background and Notation 

To begin with, our graphs are undirected and simple, i.e., they have no self-loops nor parallel- 
edges. Let [n] = {I,..., n} for every n e N. Let G = {V, E) be a graph with vertex set V = [n] 
and edge set E = {ei,... ,em}- Here, \E\ = m and \V\ = n. Moreover, for any vertex subset 
S CV, let x{S) be the characteristic vector of S, i.e., for every i G [n] the i-th coordinate of x{S) 
is I if i G S', and it is 0 otherwise. For any vertex v gV ot G, let r(u) = {u G V \ {u,v} G E} 
be the neighbourhood of v, and let 5(y) = |F(?;)| be the degree of v. Let A = max^gy S(v) be the 
maximum degree of G. For any vertex subset S C V and any index i G [n], define S<i = S fl [f] 
and S<i = S n [i — 1] (where S<i = 0). For any two vertex sets X and Y, we say that X 
is lexicographically greater than Y, denoted X >iex Y, if the smallest vertex (i.e., the smallest 
natural number i) in the symmetric difference {X \ H) U (H \ X) is contained in X. The usual 
common ordering on N is denoted < (i.e., without the subscript lex). A clique is any subset 
K of the vertex set V such that any two vertices in K are adjacent. A clique is maximal when 
it is not a subset of any larger clique. For any clique K (not necessarily a maximal one), let 
lc(Ar) be the lexicographic completion of the clique K, namely, the lexicographically greatest 
among all the maximal cliques containing K. It is clear from its definition that lc(Ar) is not 
lexicographically smaller than K. To conclude this section, let Kq be the maximal clique which 
is the lexicographically greatest among all the maximal cliques of G. Notice ATo = lc(0). This 
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notation is exemplified in Fig. (TJ where a running example for this paper is proposed. 


3 The RS-Tree of Maximal Cliques 

This section recalls some major aspects of the previous algorithms for MCL, which were devised 
by Tsukiyama et al., Johnson et al.^ Makino and Uno, as these comprise the backstage and the 
backbone of our present solution. In particular, this section reworks the construction of the 
Reverse Search Tree (RS-tree) Tg for enumerating all the maximal cliques of any given graph G. 
This is done by studying the corresponding parent-child relations. In the original paper of Makino 
and Uno, all proofs about the characterization of 7 g were omitted due to space restrictions. In the 
present work full proofs are presented for the sake of completeness. Indeed, offering a simple and 
self-contained exposition of what in was one of our purposes. In cleaning out the arguments, 
and to help the understanding of the reader, we opted for restructuring also the statements and 
the network of their relations. To begin with, let us observe some introductory properties. 

Proposition 1. Let K and K' be two cliques of G = (V, E). If K C K' then lc{K) >iex lc{K'). 

Proof Notice that K C K' Q lc{K') and recall that lc{K) is the lexicographically greatest 
maximal clique containing K. □ 

We proceed by observing a simple characterization of lc(-). 

Proposition 2. Let K be a clique of G = (U, E). For any u G [n], precisely one of the following 
two must occur: 

1. V G lc{K); 

2. there exists zGiiru([u — I]n lc{K)) such that v ^T{z). 

Proof It is sufficient to show that v 0 lc{K) if and only if (2) holds on v. Since lc{K) is the 
lexicographically greatest maximal clique containing K, then v 0 lc{K) if and only if at least 
one of the following two conditions hold: either v is not adjact to all vertices in K (i.e., there 
exists z G K such that v 0 r(z)), or v is not adjacent to some z G lc(i^) which is smaller than v 
(i.e., there exists z G [u — 1] fl lc{K) such that v 0 r{z)). For this reason, v ^ lc{K) if and only 
if there exists z G iiT U ([u — I] fl lc(/f)) such that v ^ r(z). □ 

The next proposition shows that lc(-) is computable within 0{m) time. 

Proposition 3. Let K be a clique of any given graph G = (V,E), where \V\ = n and \E\ = m. 
The lexicographical completion lc{K) is computable within 0{min{m, A'^}) time. 

Proof. Consider Algorithm [TJ It takes as input a clique K of G. Moreover, it employs the 
subprocedure is-complete() in order to test, on input {u,X) for some u € V and X C V, 
whether {u,x} G E for every x G X. This check can be done in 0(S(u)) time. So, Algorithm [T] 
works as follows. Firstly, an auxiliary set S gets initialized as S' = AT at line 1. Soon after, a 
vertex v G E is picked up arbitrarily at line 2. In the rest of the algorithm the auxiliary set S 
will be incremented. The rationale, here, is that every node taking part to this augmentation 
must be among the neighbours of v. In fact, at line 5, Algorithm [T] augments S with vertex u 
if and only if u is the lexicographically greatest vertex (i.e., the smallest natural number) which 
lies in lc(Ar) \ S. At the end, S is returned at line 6. Let S be the set outputted by Algorithm 1. 
Notice that, for every v G [n], precisely one of the following conditions hold: either v G S or there 
exists z G AT U ([u — 1] n S) such that v ^ r(z). Thus, by Proposition!^ S = lc{K). Of course. 
Algorithm [T] halts within time 5{ui)) = 0(min{m, A^}). 
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Algorithm 1: Computing the Lexicographical Completion lc(-). 
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Procedure compute-C{K,G) 

Input: A clique K of G = {V,E). 

Output: The lexicographical maximal clique lc(A') containing K. 
S <— K\j / initialize the set S to K 
V •«— pick any vertex v £ K-, 

foreach u £ r({)) (in ascending order w.r.tN) do 
if u ^ K and is-complete(u, S) = true then 
|_ S <- SU{u}- 
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return S; 


Here above, the pseudocode of Algorithm [T] closes the proof. □ 

Given any n-vertex graph G = (V, E), for any maximal clique C Kq) there exists at least 
one index (i.e., one vertex) i G [n] such that lc(C'<i) ^ C. Indeed, lc(C<o) = Kq ^ C. In virtue 
of this fact it makes sense to define the parent of C as V{C) = lc(C<i), provided that i G [n] is 
the greatest index satisfying lc(C'<i) C. Such an index i is called the index ofC, and it is also 
denoted by i{C). As mentioned, these indices are well defined. Moreover, notice P(C') >iex G, 
i.e., the parent P(C') of any maximal clique G (y^ Kq) is not lexicographically smaller than G. 
This implies that the corresponding parent-child binary relation is acyclic and creates an in-tree, 
denoted Tg, which is directed towards its root Kq. We say that Tg is the RS-tree of G. Of course, 
the nodes of Tg corresponds to the maximal cliques of G that we aim to list. 

Fig. [2] depicts the RS-Tree Tg associated to the running example graph of Fig.[TJ Every node 
of Tg depicts a maximal clique of G and its corresponding index. 



Figure 2: The RS-Tree Tg corresponding to the running example graph G of Fig.[TJ 

At this point, we shall provide an effective algorithm for computing V{-). Indeed, it is not 
difficult to see that T{C) is computable from G in linear 0{m+n) time. Here below, Proposition|4] 
shows how to compute the index i(-), while Proposition [5] finally provides an 0{m + n) time 
algorithm for computing the parent relation V{-). 

Proposition 4. Let G be a maximal clique of G = {V,E), where \V\ = n and \E\ = m. 

The index i{G) is computable within 0(m + n) time. 

Proof. Consider Algorithm [21 it takes as input a maximal clique C of G and it aims to return 
the corresponding index i{G) as output. The procedure works as follows: at the beginning, 
each vertex u G P \ G is marked as active. Moreover, the procedure keeps track of a counter 
dc{v) : P —>■ N, which is initialized to be the degree of v with respect of G, for each v £ V. 
Then, for each u G P in descending ordering from n to I, Algorithm |2] checks whether “i(G) = v” 
in the following manner: I. if t; ^ G, then v becomes deactive at line 14; 2. otherwise t G G, 
then V is (roughly speaking) turned-off within G, and thus the counter of every u G r(r;) \ G is 
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decremented at line 7. The size of C is also decremented at line 8 . At this point, if there exists 
z & V \ C which is still active and such that dc{z) > size, then v is returned as output at 
line 12. The existence of such z can be checked quite efficiently as follows: at line 9 the procedure 
picks the greatest vertex (i.e., the greatest natural number) u G C such that u < v, say u, then, 
at line 10 and line 11 , the neighbourhood of u is inspected in order to check whether there is any 
z G r(it) \C which is still active and such that dciz) > size. Also notice that, if line 12 is never 
reached, the procedure returns T at line 15 (because the index of the root Kq is undefined). 

This concludes the description of Algorithm [2] 

Algorithm 2: Computing the Index i(-). 

Procedure compute-i{C) 

Input: a maximal clique C of G = {V,E). 

Output: the index i{C) of C. 

dc{v) •«— |{u G C I {«, a} G A}|; // i.e., the degree of u G C in C. 
size •<— \C\\ 

label each a G C \ C as active; 
foreach v G V, from n to 1 do 
if V G C then 

foreach u G r(a) \ C do 
|_ dciu) -f- dciu) - 1; 

size <— size — 1; 

u G- the greatest vertex u G C such that u < v; 
foreach z G r(tt) \ C do 

if z is active and dc{z) > size then 
return v; 

else 

label V as deactive; 

return T; 
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The correctness of the procedure follows easily from the definition of lexicographic completion 
lc(-) and that of index z(-). Concerning its time complexity, observe that the procedure visits 
each vertex and each edge at most 0(1) times, and the work done at each one of those is 0(1) 
as well, so that Algorithm [5] always halts within 0{m + n) time. □ 

Proposition 5. Let C be a maximal clique of G = (V,E), where \V\ = n and |i?| = m. 

The parent maximal clique 'P{C) of C is computable within 0{m + n) time. 

Proof. Firstly, compute the index i{C) with Algorithm (defined in Proposition 14]) . Secondly, 
compute 'P(O) = lc(Oi(c)) by invoking Algorithm [T] (defined in Propostion [S]) . □ 

As already mentioned, our algorithm, that of Tsukiyama et al. [Il[, Johnson et al. Q, as well 
as that of Makino and Uno Q, traverse the nodes of 7 g in a DFS-like fashion starting from the 
root Kq. However, in order to traverse Tg^ "we first need to show how to effectively characterize 
all the children C of any given maximal clique P ol G. The following is a simple but crucial 
observation. In order for P to be the parent of C, two reconstructability conditions should hold 
at the same time, namely: 

1. the parent P should be “reconstructible” from its child G; 

2. the child G should be “reconstructible” from its parent P and index i{G). 

On this way, the following fact turns out to play a twofold pivotal role. 
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Lemma 1 (Reconstructability Lemma). Let P and C he maximal cliques of G. Assume that 
P = V{C) and let i be the index of C. Then, C^i = P^i D r(z). 

Proof. • Firstly, we argue C'<i C P^^ n r(z). 

Since i is the index of C, then i G C. Thus, C<i C r(i). Moreover, P = V{C) = lc(C'<i) 
implies C'<i C 

• Secondly, we argue H r(i) C C<i. 

Let V be any node in P<inr(j). Notice v < i and recall that C = lc(C'<i). In order to show 
V S C'<i, it is thus sufficient to prove v G lc(C<i). For this, we shall rely on Proposition [51 
Now, observe the following two facts: 

1 . No z £ (7<i is such that v 0 r(z). In fact, v is adjacent to all vertices in C<i (because 
V G P = lc(C'<i)) and v is adjacent to i (because v G r(j)). 

2. No z G [u — I] n lc{C<i) is such that v ^ r(z). In fact, since P is a clique containing 
u, no z G [u — 1] \r(u) belongs to P, whence neither to C'<i (because C<i C P<i C P), 
nor to C<i (because v < i). This implies that no z G [u — 1] \r(u) belongs to lc(C<i), 
because lc{C<i) = C and v <i. 

At this point, v G lc(C<i) follows directly from Propositionj^ 

□ 

We are now in position to characterize the parent and children reconstructability conditions. 
Indeed, at this point, they both turn out to be a direct consequence of Lemma [1] 

Proposition 6 (Parent and Child Reconstructability). Let P and C be maximal cliques of G. 
Assume that P = P{G) and let i be the index of G. Then, the following two conditions hold: 

1. P = = ic(P<inr(z)) (Parent Reconstructability) 

2. G = lc((P<i n r(z)) U {i}) (Child Reconstructability) 

Proof of 1. It is sufficient to observe the following: 

P = lc(C<i) (because i = i{C) and P = P(C')) 

= lc(P<i n r(z)) (by Lemmad]) 

Proof of 2. It is sufficient to observe the following: 

C = lc(C'<i U {z}) (because z = z(C)) 

= lc((P<i n r(z)) U {z}) (by Lemmad]) 

The rationale which allows for the computation of a maximal clique child G is that of reversing 
the parent relation P(-), in the spirit of the Reverse Search Enumeration of Avis and Fukuda [l|. 
Observe that Item 2 of Proposition [ 6 ] pointed out a shape for such an inversion. In fact, in light 
of Proposition [51 given any maximal clique P of G and any z G [rz], it is natural to introduce the 
following notation: 

c{K,i) = ic((A:<i nr(z)) u {z}). 

Proposition [5] tells us that whenever G is a child of P with index z, then G = C(P, z). This means 
that, given P, we are called to characterize all the indices z G [n] such that C(P, z) is a child of P 
with index z. In order to do that, let us proceed by observing the following property enjoyed by 
lc(-), it will turn out to be pertinent in a while. 


Lemma 2. Let G be any n-vertex graph. Let K be a clique of G and let a,b G [n] be two indices 
such that a <b. Then, lc{lc{K<a)<b) = t c{K<a); 

Proof. Since a < b, then K<a C lc{K<a)<b- Thus, by Proposition [U 

lc{K<a) >lex lc(lc(i4:<a)<b). 

On the other way, lc{K<a)<b C lc{K<a)- Thus, by Proposition [T] again, 

lc(lc(it:<a)<b) >lex lc(lc(i4:<a)) = lc{K<a)- 

Since >iex is a total ordering, the observations above imply lc{lc{K<a) <b) = □ 


We are now in position to characterize all the children of any given maximal clique P. 

Proposition 7. Let P be a maximal clique of any given n-vertex graph G = (V,E). 

There exist a child of P having index i if and only if i ^ P U [i(P)] and the following two 
reconstructability conditions hold: 

a. P<, = lc(P<, nr(i))^^ 

b. ic((p<,nr(z))u{z})^^ = p<,nr(i) 

Proof. (=^) Let G be the child of P having index i (which exists by assumption). 

Firstly, we argue that i G P U [z(P)]. Indeed, since P = V{C) = lc(C'<i) and C = lc(C'<i), 
then i ^ P. Moreover, the following equalities show that i > i{P): 

lc(P<i) = lc(lc(C'<i)<i) (by P = lc{G<i)) 

= lc(C'<i) (by Lemma [2|) 

= P (by P = lc(C'<i) 

Finally, we argue that both the (a) and the {b) conditions hold on i. 

Proof of a. By Item I of PropostionjBl we have P = lc(P<i n r(z)). Thus, (a) holds on i. 

Proof of b. By Item 2 of Proposition [51 we have that G = lc((P<i n r(z)) U {z}). By LemmalU 
we have C'<i = P<^i n r(z). These facts imply that the (6) condition holds on i. 

(<^) We argue that whenever both the (a) and the (&) conditions hold on some z ^ PU [z(P)], 
then there exist a child of P with index z. Let C = lc((P<inr(z))U{z}) for some z as mentioned. 
Firstly, observe that C ^ P: in fact, z G C by definition of G but z ^ P by hypothesis. 

Now, we argue that P = lc((7<i). In fact, observe that the following equalities hold: 


P = lc(P<0 

= ic(ic(p<i n r(z))<i) 

= ic(ic(ic((p<, n r(i)) u {z})<,)<i) 
= lc(lc(C'<i)<i) 

= lc(C<,) 


(by i ^ [i(P)]) 
(by (a)) 
(by (&)) 

(by definition of G) 
(by LemmalU) 


To conclude the proof, it is sufficient to check that lc(C<i) = C. 

lc{G<i) = lc(C<i U {z}) (because z G G) 

= lc(lc((P<j n r(z)) U U {z}) (by definition of G) 

= ic((p<inr(i))u{z}) (by (&)) 

= G (by dehnition of G) 


□ 
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Observe that, since lc(K) can be computed from any clique K in 0{m) time by Proposi¬ 
tion |31 it is possible to compute all the children of a given maximal clique P in 0{m'n) time by 
Proposition [71 In fact, it is sufficient to check whether the conditions (a) and (6) both hold on 
the index i, for each i G [n] \ (P U [i(P)]). In this manner, listing each node of To (namely, 
each maximal clique of G) with 0(mn) time delay. In order to improve over the 0{mn) bound, 
Makino and Uno reduced the problem of checking the conditions (a) and (&) of Proposition [7] to 
that of multiplying two n x n square matrices [9|. In doing this, they observed (without proof) 
the following two lemmata. These are a restating of the conditions (a) and (&) of Proposition [71 
We remark that Lemma [3] and Lemma [Hare really at the ground of our reduction to rectangular 
matrix multiplication. 

Lemma 3. Let P be a maximal clique of any given n-vertex graph G = {V,E). 

Then, i G [n] satisfies P<^i = Ici^P^i n r(i))^^ if and only if there doesn’t exist any index 
S [i — 1] \ P such that the following conditions hold: 

a’, j is adjacent to all vertices in P<j; 

a”, j is adjacent to all vertices in P^i D r(i). 

Proof. Assume that for some i G [n] there exists j G [i — 1] \ P satisfying both the {a’) and 
{a”) condition. Then, there exists j' < j such that j' G lc(P<i fl r(f))<i \ P<i, thus implying 
lc(P<i n r(*))<i 7 ^ P<i. For the opposite direction, assume that for some i G [n] there is no 
j G [i — 1] \P satisfying both the (o’) and {a”) condition. Then, lc(P<i nr(f))^. = P^i follows 
by definition of lc(-). This concludes the proof. □ 

Lemma 4. Let P be a maximal clique of any given n-vertex graph G = {V,E). 

Then, i G [n\ satisfies lc{fP^i n r(i)) U {z}) <. = P<^ n r(z) if and only if there doesn’t exist 
any index j G [z — 1] \ (P<i fl r(z)) such that the following condition hold: 

b’. j is adjacent to all vertices in [P<o H r(z)) U {z}. 

Proof Let C = lc((P<;inr(z))U{z}) for some i G [rz]. Assume that there exists j G [z — 1] \ (P<i H 
r(z)) satisfying the {b’) condition. Then, there exists j' < j such that j' G C'<i \ (P<i fl r(z)). 
This implies C<i T {P<i n r(z)). For the opposite direction, assume that there is no j G 
[z — 1] \ (P<i nr(z)) satisfying the {b’) condition. Then, C<i = P<i fl r(z) follows by definition of 
G and lc(-). This implies that C'<i = P<i nr(z) if and only if there is no j G [z — 1] \ (P<i nr(z)) 
satisfying the (b ’) condition. □ 

4 Reduction to Rectangular Matrix Multiplication 

Given any maximal clique P of G = (V,E), consider the problem of computing all the indices 
z G [rz] \ (PU [z(P)]) such that C(P, z) is a child of P with index i. By Proposition [71 this amounts 
to check, for each z G [rz] \ (P U [z(P)]), whether both the conditions (o) and (&) hold on z with 
respect to P. So, let us denote by Lff and the sets of indices z G [zz] \ (P U [z(P)]) that satisfy 
the conditions (a) and (6) (respectively) of Proposition |7| for some given maximal clique P of G. 
Recall that the index z(P) can be computed from P within 0(rz + rrz) time (by Proposition 2]). 
The most expensive step is thus the computation of both and (which, recall, can always be 
done within 0(rnn) time by performing at most rz computations of the lexicographical completion 
lc(-)). Also, recall that this computation can be performed by checking the equivalent conditions 
(a’), (a”) for iff and (b’) for iff given by Lemma |3| and Lemma 0| (respectively). As already 
mentioned in the previous section, in order to compute iff and iff , Makino and Uno relied on 
fast square matrix multiplication, thus sharpening Tsukiyama’s 0(mn) bound to 0{n^) Q. 
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At this point, we shall diverge from their approach in the following manner. 

An Overview. We denote by the problem of computing the sets and with respect 
to some given maximal clique P of any given n-vertex graph G = (V, E). Moreover, we denote by 
B = {Pi, ■ ■ ■ !-P|b|} any batch (i.e., family) of pairwise distinct maximal cliques of G. It is quite 
natural at this point to consider the problem X^, namely, that of solving X^ for every P € B. 
The intuition underlying our approach goes as follows: instead of solving each problem instance 
X^ separately (one after another, by reducing it to square matrix multiplication as in 0 ), we 
propose to group multiple maximal cliques into batches B and to solve the corresponding problem 
X^ (for each batch B), in one single shot, by reducing it to that of multiplying two rectangular 
matrices of size |i3| x n and n x n^. This rectangular matrix product can be performed in an 
asymptotically efficient way by adopting the algorithms devised by Le Gall in 0. As we will 
show in the forthcoming, the optimal size of B turns out to be \B\ = \V\^ = . For this reason, 

we are going to deal with n? x nhy n x n?' matrix products. 

The Reduction. 

By virtue of Proposition [3 Lemma [3] and Lemma SI the problem of solving X^ boils down in 
a straightforward manner to that of solving the following ^Pernel” problem, which is denoted /C® 
and defined in this way: given B as input, for every maximal clique P € B, and for each pair of 
indices {i,j) € [n] x [n], decide whether {i,j) is good with respect to P, namely, decide whether 
there exists u G P^i fl r(i) such that {u,j} ^ E. A solution of K,^ is a mapping which assigns to 
each P G B a. boolean vector, denoted (where gfj G {T, T} for every i,j G [n]), such that: 



T, if the pair {i,j) is good with respect to P; 
T, otherwise. 


The rationale at the ground of these definitions clearly lies within Lemma [3] and Lemma SI In 
fact, with these lemmata in mind, a moment’s reflection reveals that once B i— > [gfj]ij has been 
determined, then, for each P G B, it is possible to solve X^ within time 0{n^). 

In summary, it is not difficult to see that these arguments allow one to solve within time: 


Time[l®] = 0(Time[/C^] +|S|n^). 

Indeed, solving KP turns out to be the time bottleneck for solving X^. The following propo¬ 
sition finally shows how to reduce K,^ to the problem of multiplying two rectangular matrices of 
size \B\ X n and n x n^. 


Proposition 8 (Reduction to Rectangular Matrix Multiplication). Let B = {Pi,... ,P\b\} be a 
batch of maximal cliques of any given n-vertex graph G = (V,E). Consider the |P| x n matrix 
Mg whose k-th row, denoted Xk for every k G [| 6 |], is the characteristic vector Xk = x(Pk)- 
For every i,j G [n], define the following subsets of V: 


Ai = V<i n r(i) and Bj = T{j). 

Let Mg be the n xn^ matrix whose {i,j)-th column is the characteristic vector Xij = x{Ai \Bj). 
Let Mjs^g be the \B\ x matrix obtained by performing the matrix product: 


Mi3,g = Mb Mg- 


For every k G [|P|] and i,j G [n], denote by MB,G[k, ihj)] bbie particular entry of Mb,g whose 
row index is k and whose column index is {i,j)- Finally, let us define: 


r T, ifMB,G[k,{Lj)]>0; 

{ T, otherwise. 


Then, the mapping Pk i-t [g^J^jij, which is defined for every P^ G B, is a correct solution of . 
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Proof. To start with, let us fix P S S and i,j € [n] arbitrarily. Observe that j € V is adjacent 
to all the vertices in P^i 0 r(z) if and only if (P<i O r(z)) \ r(_j) = 0. Equivalently, 

(i,j) G [n] X [n\ is good w.r.t. P P (1 nr(i)) \r(j)^ 0. 

Clearly, (V<i 0 r(i)) \ r(j) depends only on i,j and not on P, so one can safely write this set as 
Ai \ Bj. Thus, in order to assess whether {i,j) is good with respect to P, it is sufficient to check 
whether P 0 (Ai \ Bj) ^ 0. Let k G [|S|] be the index of P in S, i.e., assume that P = Pk € B. 
By definition of Mb,Mo and from the fact that Mb,g = Mb Mg, the following holds: 

P n {Ai \ Bj) ^ 0 if and only if Pb^gI^, (*, j)] > 0. 

This implies the thesis and concludes the proof. □ 


Time Complexity of the Reduction. We now focus on the time complexity of the reduc¬ 
tion described in Proposition |S1 To begin, as shown in .^TCndix El and according to current 
upper bounds on rectangular fast matrix multiplication [^, [7| , the optimal size of the batch B 
turns out to be \B\ = inf. Let us briefly retrace the argument that led us to this result, the 
full details are given in Appendix El Recall from Proposition [ 8 ] that Mb,g can be computed by 
performing an |P| x n by n x rf matrix product. Also recall that, by computing Mb,g, one 
actually solves in one single shot \B\ problem instances, i.e., for every P G B. Let fc G Q be 
such that \B\ = \n^\. Then, computing Mb^g, the amortized time Time[l^] for solving each 
problem X^ for P G P becomes the following, where Time[Mg_G'] denotes the time it takes to 
compute Mb ■ Mq: 


Time X^ = 


/Time[l®] \ ^ /Time[/C®] -k \B\n^ 
V \B\ ) V \B\ 

Time [Mg, g] 2 \ 


= ^( 

= o{n‘ 




Our aim would be to find k G [0, -|-oo)nQ such that a;(fc, 1, 2) — k attains its global minimum value. 
Even though the exact values of u}{k, 1, 2) are currently unknown, one can nevertheless minimize 
functions arising from state of the art upper bounds on tLi(A:, 1,2). These upper bounds have 
been derived within the framework of so-called bilinear algorithms, see e.g. (7|, l^- For instance, 
in this work we consider the bound /hpqs of Huang and Pan Q, and then the bound /lgi 2 of 
Le Gall Q (in particular, /lgi 2 leads to the best upper bound on a;(2,1,2) which is currently 
known). Here, both estimates are be applied in such a way as to bound w(fc, 1,2) — A: from above. 
The corresponding functions, that we aim to minimize, are denoted gi and g 2 . Their behaviour 
is shown in Fig. [31 

The first function gi is defined as follows: 

gi{k) = /HP 98 (fc) - k, for every k G [0, -|-oo), 

here, /hpqs • [ 0 , -|-oo) —^ R is a piecewise-linear function, which was essentially pointed out by 
Huang and Pan in Q, and it satisfies uj{k, 1, 2) < /HPgs)^) for every k G [0, -Too) nQ. An analytic 
closed-form formula for /hpqs is derived in Appendix El Here, we just mention that gi{k) attains 
its global minimum value at fc = 2 , i.e., 

gi{2)= min 51 (fc) = 2.2107878 

fcG[0,+oo) 
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Figure 3: Plot of gi{k) for k G [0, +oo), and of g^ik) for k G {1} U [2, +oo). 


The qualitative behaviour of gi is traced in Fig. [3] with a filled blue colored line. 

In a similar way, the second function 52 is defined as follows: 

52 (fc) = fc/LGi 2 (fc) - k, for every fc G {1} U [2, + 00 ). 

here, 32 takes into account the upper bound fhGi 2 {k) for a;(l, 1,1/fc), which was established by 
Le Gall in Q. We remark that, at the current state of art, the upper bounds of Le Gall apply to 
uj(r, s, t) if and only if r = s. For this reason, when k G [2, + 00 ) fl Q, we applied Le Gall’s bounds 
on uj{k, 1 , 2 ) by relying on the following upper bound: 

uj{k, 1 , 2 ) < uj{k, 1 , k) (for every [ 2 , + 00 ) fl Q) 

= fca;(l, 1 , 1 /fc) (by homogeneity) 

< kfhGi2ik). 

In addition, we applied Le Gall’s bounds on n x n by n x matrix products by considering the 
complexity exponent Cli( 1, I, 2), which is actually one of those explicitly studied by Le Gall in 
Notice that, when k G (1,2), it is not possible to apply the results of Le Gall to bound 
uj{k, 1 , 2 ), because k ^ 1, k ^ 2 and 1 7 ^ 2 so the above mentioned condition (i.e., that r = s in 
ijj{r,s,t)) doesn’t apply in that case. This explains why g 2 {k) is defined on k £ {1} U [2,+ 00 ). 
The qualitative behaviour of g 2 is traced in Fig. [3] with a dashed red colored line. Goncerning 
the global minimization of 52 , it turns ou10 that: 

02 ( 2 ) = min g 2 {k) = 2.093362. 

fcG{l}U[2,+oo) 

In summary, both estimates and Q indicate that the minimum complexity comes at fc = 2, 
namely, they both indicate that the optimal size of the batch B is given by \B\ = n^. 

From now on, let us fix the size of B to be \B\ = n?. Then, by Proposition |51 Mg <3 can be 
computed by performing an x n by n x matrix product. Assuming iV = n^, it is equivalent 
to consider matrix products of type N x by x N. As shown by Le Gall in Q, the 

corresponding complexity exponent, which is a;(l, 1,1/2), satisfies w(l, 1,1/2) < 2.046681. 

^See Appendix [AI for the details. 
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Thus, Mq q can be computed within the following time bound: 


In this way we obtain: 

Time [I®] = Cl(Time[/C®] + = O (lime [M^.g] + 1^1^^) 

= 0(n2-(bl,l/2)^ = 0(^4.093362)^ 

Then, each problem instance for P G B gets solved within the following time bound: 


Ti.,[in=o(^^) 


|S| 


= ^ o(n2 °93362)^ 


.,21^(1,l,l/2)-2'^ _ 

which, we remark, it is an amortized time bound across problem instances. 


An Algorithm for Solving I®. The pseudocode for solving an instance of by reducing it 
to rectangular matrix multiplication is shown below in Algorithm [31 

Algorithm 3: Solving I® by Reduction to Rectangular Matrix Multiplication. 

Procedure SolveJiectangularGL{B) 

Input: A batch of (exactly) maximal cliques B = {Pi,..., P„ 2 } of G = {V, E). 

Output: A vector Lg representing all children of every P £ B, i.e., 

Lb = {(P,listp) I P € B, Vi £ [n] : i £ listp iff C{P,i) is a child of P in 7 g}. 

Ai t— V<i n r(i), for every i £ [n]; 

Bj -f- r(j), for every j £ [n]; 

Mb 4— the x n matrix whose fc-th row is Xk = x{Pk)\ 

Mg t— the n x r? matrix whose (i, j)-th column is xt^j = x{Ai \ Bj)\ 

Mb, a t— Rectangular_Fast_Matrix_Multiplication(M0, Mg); 

T, if Mg^clfe, (i, j)] > 0 


7 

8 
9 

10 

11 

12 

13 

14 


15 

16 


17 


r,Pk 


(for every Pk G B and i,j £ [n]); 


T, otherwise 

Lb •«— a vector of lists (of integers), one listp for each P £ B; 

foreach P £ B do 

Jp 4 — {j £ [n] I j adjacent to all vertices in P<j}; 
ip ■(— compute-i(P); 
foreach i £ [ip + 1, n] Pi N s.t. i ^ P do 
good ■<— T; 

foreach j £ [1, i — 1] n N do 

if gfj = T and ^(j 0 P<i n r(i) and {j, i} £ P)or(j ^ P and j £ Jp)^ then 
good ■<— T; 
if good — T then 

Lb <— add the index i to the list listp; 
retnrn Lg; 


The algorithm works as follows. From line 1 to 5, the variables and the 

matrices Mg, Mq are initialized as they were defined in PropositionjSl Then, at line 5, the matrix 
Mb,g = Mg Mg is computed by invoking a rectangular fast matrix multiplication algorithm, e.g., 
the procedure devised by Le Gall in on input {Mg, Mg)- At this point. Algorithm |3| is in 
position to compute, for each maximal clique P G B, all the good indices i G [n], namely, those 
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that lead to a legitimate child C = C{P,i) of P. This is done by checking the entries of Mq q 
corresponding to all the indices {i,j) (as prescribed by Proposition (3 Lemma [3l Lemma SI and 
Proposition SI ■ Whenever an index i is found good for a maximal clique P G 6, at line 15 of 
AlgorithmSl then i gets added to a list at line 16, denoted listp, which aims to collect all (and 
only) the good indices of P. At the end of the computation, the vector Lg containing all the 
pairs (P, listp) is returned at line 17. This concludes the description of Algorithm [3l Notice 
that, since \B\ = v? and every P € B has at most n children, then the space usage of Pg is 0{n^). 
The following proposition asserts the correctness and the time complexity of Algorithm [3l 

Proposition 9. Let G = {V,E) be an n-vertex graph. Consider any invocation of Algorithm\^ 
on input B, where B = {Pi,..., P„ 2 } is batch of maximal cliques of G. 

Then, the procedure correctly outputs a vector Lg such that: 

Lb = {(P, listp) \ P G B, \/ i G [n] : i G listp iff C{P,i) is a child of P in Tg} ■ 

Moreover, Algorithm{^ always halts within the following time bound: 

Time[Algorithm{^ = ^ q(^ 4 . 093362 )_ 

In this manner, each problem instance gets solved within amortized time: 

Time[lP] = 0(n2“(bl.l/2)-2) ^ ^(^ 2 . 093362 )^ 

which is an amortized time across problem instances. 

Finally, the procedure employs 0(n‘^) working space. 

Proof. The correctness of Algorithm [3] follows straightforwardly from Propostion [3 Lemma |3l 
Lemma Eland Proposition [ 8 l The running time has been analyzed already in the previous para¬ 
graph. The space bound comes from the fact that Mb,g has n'^ entries, being it of size xn^. □ 

A Remark on the Time Delay. We wish to notify that it is actually possible to obtain a listing 
algorithm with a rigorous 0 (n 2 <^(i,i,i/ 2 )- 2 ^ time delay: this can be done by introducing a queueing 
scheme, which firstly collects a certain polynomially bounded amount of maximal cliques of G as 
a bootstrapping phase. The details of the queueing scheme are given in Subsection 16. II 

5 Batch-DFS Backtracking 

In the previous section we described how to reduce to rectangular matrix multiplication. 
Nevertheless, the description of our MCL procedure is not yet complete: since the algorithm 
needs to traverse the entire RS-tree 7 g (consuming only polynomial space), a careful backtracking 
procedure must be taken into account to keep the search process going on. We propose an abstract 
backtracking scheme, named Batch-DFS, which will make the skeleton of our MCL solution. 

An Overvie-w of Batch-DFS. Let T be an n-ary (rooted) tree of height at most n, for 
some n € N. We denote by Kq the root of T. Assume we are given a procedure children(), 
which takes as input a batch B of nodes of T and returns as output a vector Lg containing all 
the children C of P, for every P G B. Notice that 0 < |Lg| < \B\n. Assuming we aim to visit 
all nodes of P, meanwhile consuming only a polynomial amount of memory in n, then our first 
choice would have been to perform a DFS on T. Nevertheless, we now show that it is possible 
to explore 7” by (somehow) grouping the search of new children nodes into batches B, provided 
that: ( 1 ) the size of each batch is polynomially bounded in n, and ( 2 ) the backtracking phase is 
performed on a LIFO policy. The pseudocode given below encodes Batch-DFS, which will be our 
reference model of backtracking for directing the search process towards yet unexplored nodes. 
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Algorithm 4: Batch Depth-First Search. 


1 

2 

3 

4 

5 

6 
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Procedure batch_DFS{Ko, childrenO, B) 

Input: the root Ko of T, where T is a tree having height n € N; also, a procedure 

childrenO which takes as input a batch B of nodes of T and returns as output a 
vector Lb containing all the children C of P for every P £ B\ finally, the capacity 
P G N, B > 0, of any batch B. 

Output: a listing of all the nodes K of T. 

Sbt ■£- {Ko};// Sbt is a backtracking stack, implemented as a LIFO stack and initialized with Kq 
while Stt yf: 0 do 

B 0;// B is a batch of nodes of T with capacity B, initialized to be empty 
while \B\ < B and Sbt 7 ^ 0 do 

P -f- pop_f rom_top(Sbt);// remove one single node P from the top of Sbt 
B^BU{P}; 

print(P);// print P as output 
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Lb ■£- children(B);// the vector of all children C of P, for every P £ B 
Sbt t— push all elements of Lb on top of Sbt; 


Description of Batch-DFS. To start with, Algorithm |4] takes the following input: the root 
Kq of T; moreover, a procedure childrenO (which is supposed to take in input a batch B of 
nodes of T, and to return a vector Ljg containing all children C of P, for every P £ B); hnally, 
a positive number B £ N, representing the hxed capacity of any batch B collected at lines 4-7. 
The procedure aims to provide a listing of all the nodes K of T. 

Going into its details. Algorithm 2] works as follows. A LIFO stack Sbt is maintained in order 
to direct the search of yet unexplored nodes. Initially, Sbt contains only the root Kq of T (line 1). 
Then, the procedure enters within a while-loop, which lasts until Sbt 7 ^ 0 at line 2. Herein, the 
procedure tries to collect a batch B of exactly \B\ = B nodes, picking out new nodes (as needed) 
from the top of the stack Sbt at line 5. Every node P that is removed from Sbt at line 5, and 
then inserted into B at line 6 , is also printed out at line 7. Observe that even if the size of the 
batch fails to reach the amount B, i.e., even if it happens “\B\ < B and Sbt = 0” at line 4, then 
Algorithm 2] moves on anyway. At line 8 the procedure childrenO is invoked on input B, in 
order to generating the vector Lg containing all children nodes C of P, for every P £ B. Soon 
after, each of such child node C is pushed on top of Sbt (see line 9). 

As already mentioned. Algorithm 0] halts as soon as the condition “Sbt = 0” holds at line 2 . 

An Analysis of Batch-DFS. The following propositions starts our analysis of Batch-DFS 
by showing that every node A of T is eventually outputted (w/o repetitions). 

Proposition 10. Let P he an n-ary tree, having height at most n G N and rooted in Kq. Consider 
any invocation of Algorithm^^on input {Kq, childrenO, B) , where i? G N, P > 0. Then, every 
node K of P is eventually outputted (at line 7), without repetitions. 

Proof. • Fact 1. We first argue that every node A of T is eventually outputted. 

Let A be any node of P. The proof proceeds by induction on the distance dist 7 -(Ao, A) 
between the root Kq and A. As a base case, it is easy to check (from the pseudocode of 
Algorithm 0]) that the root Kq is printed at the first iteration of line 7. Now, assume that 
every node A having distance at most d — dist 7 -(Ao, A) from Kq is eventually printed out 
at some iteration of line 7. Let A be any node at distance dist 7 -(Ao, A) = d -F 1 from Kq. 
Also, let V{K) be the parent of A. Since dist 7 -(Ao, P(A)) = d, then at some iteration of 
line 7, P(A) is outputted, hence it is also added to B at line 6 . Subsequently, at line 9, all 
children of V{K) (and thus, in particular, A) are added on top of ^bt- Eventually, at some 
future iteration of line 5, A must be picked up from the top of Pbt- As that point, A must 
be outputted at line 7. Since A was arbitrary, the thesis follows. 
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• Fact 2. Each node K can’t be printed out twice. 

Indeed, when K is printed at line 7, it is also removed from S'bt, and all of its successors 
are added on top of S'bt: this is the only way in which a node can enter within Sbt- Since 
T is a tree, the thesis follows. 

□ 

Proposition 11. Let T he an n-ary tree, rooted at Kq. Consider any invocation of Algorithm\^ 
on input (Kq, childrenO, B) , where B G N, i? > 0. In particular, let ij he any iteration of the 
while-loop at line 2 of Algorithm^ Let he the corresponding batch B which is given in 
input to childrenO during tj at line 8. Then, the whole execution of ij takes time: 

Time^ij iteration of line 2j = Time[children(B^‘'^^)^ + 0(n |). 

Proof. The thesis follows directly from the definition of line 9 and from the fact that T is an n-ary 
tree, so that the vector L^^j) (at line 8 of Algorithm^]) contains at most n elements. □ 

In the next proposition we argue that Sbt can grow up its size at most polynomially in n, B. 
Before proving that, we shall introduce some notation. 

Let us consider any two consecutive iterations of the while-loop at line 2 of Algorithm 21 
say the iterations ij and ij+i- For any j > 1, let be the batch B which is given in input to 
childrenO at line 8 and during ij] moreover, let be the vector of nodes returned by the 

invocation of children(,BFj)) at line 8 during ij. We shall say that Algorithm |4] backtracks at 
the ijj-i iteration whenever it holds that: .ghi+i) L^o^), i.e., whenever, at the ij+i iteration of 
line 8, the batch contains some nodes that were not pushed on S'bt at the ij iteration of 

lines 9 ~ 11, but during some previous iteration i^ (for some k < j) instead. 

Proposition 12. Let B he an n-ary tree having root Kq and total height at most n € N. Consider 
any invocation of Algorithm^ on input {Kq, childrenO, B). Throughout the whole execution, 
the backtracking stack Sa can grow up to contain at most n^B nodes. For this reason. Algorithm^ 
consumes at most 0(n^B -\- Space^childrenO^^ space, where Space\childrenO\ denotes the 
worst-case space consumed by any invocation of childrenO. 

Proof. Since T is n-ary and \B\ < B (because of line 4 of Algorithm Id]) , then each batch B has at 
most nB children. Since Sbt is accessed adopting a LIFO policy, and since T has total height at 
most n, the following fact holds: until the first backtrack doesn’t happen, Sbt can grow its size up 
to nB elements at most n times. Therefore, Sbt can grow its size up to n^B elements, before it 
needs to backtrack at some iteration of the while-loop at line 2. As soon as Algorithm 2] starts 
to backtrack, say at the ij+i iteration, then Sbt shrinks its size, collecting (at lines 4 ^ 7) a batch 
g('-3+i) that must contain some nodes which had been pushed on Sbt at some previous iteration 
ik of lines 9 11 (for some k < j). We now observe that, at the ij+i iteration of line 8, the stack 

Sbt must contain at most as many elements as it contained at the end of the ik iteration. For 
this reason, Sbt has still no way to grow its size up to more than n^B, by going down the levels 
of T once again after that a backtracking occurred. The same observation continues to hold for 
any possible subsequent backtracking. In this manner Sbt can grow its size up to n^B nodes at 
most during the whole computation. □ 

We now aim to show another crucial property of Algorithm 21 one that turns out decisive for 
adopting Batch-DFS in order to speed-up the MCL problem. 

In order to prove this fact, it is convenient to introduce a three-way coloring scheme on T. 
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A Three-Way Coloring on T. Consider any invocation of Algorithm 0] on the following 
input (Aq, children0, i?), where Kq is the root of T. At the beginning of the execution, it is 
prescribed that all nodes A of T are colored in white. As soon as a white node A of T is pushed 
on top of 5'bt (either at line 1 or at line 11 of Algorithm 0]), then A changes its colour from white 
to green. Stated otherwise, at each step of Algorithmic all the nodes in S'bt are green. Finally, 
as soon as any A gets removed from S'bt at line 5, then A changes its colour from green to black. 

Observe that, since by Proposition HC every node of T is eventually pushed on Sbt, and then 
removed from it, exactly once, then every node of T eventually transits from white to green, and 
then from green to black. Moreover, black nodes remain such until the end of the execution. 

We proceed by observing an invariant which is maintained by Algorithmic 

Lemma 5. Let T be an n-ary tree having root Aq and total height at most n S N. Consider 
any invocation of Algorithm^ on input (Aq, childrenO, B), and let Ui be any step of execution 
of line 3. Let us denote by minimum distance between the root Kq and any node of T 

which is green at step Oi, i.e., 

Careen = ’aiin { dist-j-{KQ, A) I A G T and A is green at execution step ct^}. 

Then, at step Oi every node K G T such that distp{KQ, K) < either green or black but 

it is not white. 

Proof. Assume, for the sake of contradiction, that at step ai there exists a white node A^, in T” 
such that dist7-(Ao, Ay,) < fg^een- Recall that, at the beginning of the execution, the root Aq of 
T turns green at line 1; hence, at any subsequent step, Kq must be either green or black. Thus, 
at (Ti, there must exist at least one ancestor of A„ which is either green or black but not white, 
because there is a path from A^, to Kq. Now, let A be the ancestor of A^, which is either green 
or black and such that its distance from Kq is maximum among all of those ancestors of Aiu that 
are either green or black. What is the colour of A at step ai, is it green or is it black? 

Notice that A is not green at Ui; in fact, since A is an ancestor of A^,, then: 

dist7-(Ao, A) < dist7-(Ao, A„) < 


whereby a green colored A would contradict the minimality of Still, A is not even black 

at ap, otherwise, all the children of A would have been colored in green at some previous step 
of the algorithm, because of lines 8 ^ 11 of Algorithm 01 thus contradicting the maximality of 
dist7-(Ao, A). No colour is actually possible for A at a^, this leads to a contradiction. 

Indeed, there exists no such a white node A^,. This implies the thesis. □ 

Proposition 13. Let T be an n-ary tree having root Kq and total height at most n £ N. Consider 
any invocation of Algorithm^ on input {Kq, childrenO, B) . Then, the total number of steps of 
execution of line 4 in which the condition “Sn = 0” holds is always less than or equal to n. 

Proof. Let’s consider any generic step of execution of line 4, say step aj, such that “S'bt = 0” 
holds. Let ai, for some i < j, be the last step of execution of line 3 which precedes aj. Stated 
otherwise, we are considering a sequence of execution steps, cr^,crnext.step(i)) ■ ■ ■ ^erj, where: 

• the starting step ai corresponds to an execution of line 3; 

• crnext_step(i) i • ■ • marks the (immediately following) entrance of the computation process into 
the while-loop at line 4; 

• aj corresponds to the (subsequent) exhaustion of the while-loop at line 4; i.e., aj is the 
hrst step of execution of line 4, subsequent to ai, such that the condition “S'bt = 0” holds. 


18 


By Lemma [H at step tr^, every node K such that dist7-(_firo, K) < must be either green or 

black, and notice that there must exist at least one such green node at the step of execution 
of line 3 (otherwise we would have had S'bt = 0 just before at line 2). Since, at step Cj, S'bt = 0 
holds by hypothesis, then every node K such that dist7-(isro, K) < must be turned black 

at (7j. Stated otherwiese, all nodes having distance ig^een from Kq that were green at step tJi 
must be turned black at step aj. In this manner, we see that at step aj yet another level of depth 
in T has been loosely speaking “turned-off” completely and forever. 

Since T has total height at most n, the thesis follows. □ 


6 An Asymptotically Faster Algorithm for MCL 

The present section offers two algorithms for MCL. Our core procedure is Algorithm[5] it provides 
a listing of all the maximal cliques of any given n-vertex graph with a time delay polynomial in n. 
However, due to technical reasons (related to Proposition [5]), the procedure exhibit a time delay 
that is in some sense “amortized” across n? output operations. In subsection 16.II Algorithm [6] will 
be introduced in order to overcome this issue, thus achieving the time delay stated in Theorem[21 
The pseudocode of Algorithm [S] is presented here below. 

Algorithm 5: Listing all Maximal Cliques. 

Procedure listJ4C{G) 

Input: A graph G = {V, E), where |P| = n. 

Output: A listing of all the maximal cliques K of G. 

Kq c— construct the lexicographically greatest maximal clique Aq; 
batch_DFS(Ao, childrenO , n^);// invoke Algorithm |4] 

SubProcedure children{B) 

Input: A (non-empty) batch B = {Pi,..., P„ 2 } of \B\ < maximal cliques of G. 

Output: The vector B' of all the children of B, i.e., 

B' = {(P, listp) \ P G B, Vi £ [n] : i G listp iff C(P,i) is a child of P in To}, 
if \B\ = v? then 

B' c- Solve_Rectcingular_r(B);// invoke Algorithm [3] 
else 

L <— compute all children of B with Makino-Uno’s procedure [^; 

return B' ; 

Description of Algorithmic At line 1, the lexicographically greatest maximal clique Kq 
of G gets constructed. At line 2, Algorithm |4] is invoked on input {Kq, childrenO, n^). The sub¬ 
procedure children0 is defined as follows. It takes in input a (non-empty) batch B containing 
\B\ < v? maximal cliques of G, and it aims to return a vector Lb containing all (and only) the chil¬ 
dren of B, i.e.. Lb = {(P, listp) \ P G B, Vi G [n] : i G listp iff C(P, i) is a child of P in Tel- 
Given B in input, the course of actions within childrenO dependes on the size \B\'. 

• if \B\ = n^, then Lg is computed by invoking Algorithmon input B at line 2; 

• otherwise, if 0 < |S| < n^, then Lg is compnted with the original algorithm of Makino and 
Uno Q (the one having an OinP) time delay complexity). 

Then, Lg is returned as output at line 5 of childrenO. There’s still one missing detail. Recall 
the functioning of Algorithmic at line 5, pop_from_top() is assumed to retrieve one single 
maximal clique from Sbt (and not a pair (P, listp). For this reason, a careful implementation 
of pop_from_top() must be taken into account. It may go as follows. Firstly, pop_from_top() 
accesses to the head of the stack say (P, listp), without actually removing it from Sbt- 
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There, it removes the first element of listp, say i, thus reducing the size of listp by one 
unit. At this point, (P, listp) is removed from the top of S'bt if and only if listp has become 
empty by removing i. Finally, pop_from_top() constructs the maximal clique C(P, i), by invoking 
Algorithm [T] on input {P^i H r(i)) U {f}. Notice that any invocation of popJrom.topO takes 
time O(n^), which is due to Algorithm [TJ This concludes the description of pop_from_top(), and 
thus that of Algorithm [5l The following proposition asserts its correctness. 

Proposition 14. On input G = (V,E), the procedure Algorithmic provides a listing of all the 
maximal cliques of G without repetitions. 

Proof. Observe that Algorithm [5] invokes Algorithm |4] at line 2. Also, Propositionimplies that 
the subprocedure childrenO of Algorithm [5] is correct. The thesis follows by ProoostionfTOl □ 

The following proposition asserts the time complexity of Algorithm [5j 

Proposition 15. Given any n-vertex graph G = (V, E) as input, Algorithmic outputs the first x 
maximal cliques of G within the following time bound, for any x G N; 

Tfirst^ = 0(n“+3 + = 0(^5.3728639 ^ a,„2.093362^ _ 

Proof. The proof is divided into four steps. There, lj will denote any generic (but fixed) iteration 
of the while-loop at line 2 of Algorithm 01 

Fact 1. There exist at most n iterations Lj of the while-loop at line 2 of Algorithm 0] such that 
0 < \ and for all other iterations ij of line 2 of Algorithm0J it holds = n^. 

Proof of Fact 1. Since B = n^, we have that 0 < holds if and only if the 

condition “S^t = 0” holds at line 4 of Algorithm 0] during ij . By Proposition [131 this may 
happen at most n times throughout the whole execution of Algorithmic 

Fact 2. All maximal cliques of B^''^'^ are outputted with 0{n^) time delay. 

Proof of Fact 2. Notice that during Lj all maximal cliques in are outputted at line 7 of 
Algorithmic Just before, at line 5, popJroni_top() needs to make an invocation to Algo¬ 
rithmic (as we had already observed in the description of Algorithmic) By Propostion 01 
this latter invocation takes at most 0(n?) time. 

Fact 3. If |Shj)| = 77 , 2 ^ then the whole execution of bj takes time . 

Proof of Fact 3. This follows by Proposition [IT] and Proposition [C 

Fact f. If |6hj)| < 77 , 2 ^ then the whole execution of bj takes time 0(n‘^|6hj)|) = 0(n“+^). 

Proof of Fact f. This follows from PropositionfTC and from existence of the 0(rP) procedure 
devised by Makino and Uno in [^. 

By Facts 1^4, the above mentioned time bound on Tfirst_a: follows. □ 

To conclude, the space usage of Algorithm 01 is analyzed below. 

Proposition 16. The space usage of Algorithmic is 0{n'^). 

Proof. By Proposition[TC the space usage of Algorithm01is 0(n^i3 -I- Space [children ()]), where 
Space[children()] is the worst-case space consumed by any invocation of childrenO. There 
is still one detail that it is worth stating. Even though to represent a maximal clique requires 
0(n) space in memory, recall that within the backtracking stack 5'bt of Algorithm 01 we have 
choosen to represent all the children of any generic maximal clique K by keeping in memory the 
pair (P, listp), where listp is a list of integers having length at most n. This fact implies 
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that, in order to store all the 0{nB) children of any batch B oi B maximal cliques, we need only 
0{nB) space. For this reason, the stack consumes only 0{'n?B) space throughout the whole 
execution of Algorithm 21 as shown by Proposition [TH in the abstract setting. Now, concerning 
the MCL problem, we have B = (see line 2 of Algorithm O. Moreover, by Proposition [HI the 
following holds: Space[children()] < Space[A/gorithm [3] = 0{n^). These facts imply that 
the space usage of Algorithm [5] is O(n^). □ 

Theorem [T] follows, at this point, from Proposition [TTl [T51 and [TO 

6.1 An Time Delay Algorithm for MCL: 

The Proof of Theorem 

This subsection describes Algorithm [51 which is the procedure mentioned in Theorem [5] 

The corresponding pseudocode follows below. 

Algorithm 6: Listing all Maximal Cliques as in Theorem [2] 

Procedure soIve_MCL(G) 

Input: A graph G = {V, E), where \V\ = n. 

Output: A listing of all the maximal cliques K of G. 

Tdeiay •<— cq = cq ]; // for some sufficiently large constant co > 0 

T •<— = Cl ; // for some sufficiently large constant ci > 0 

Q •<— 0;// let Q be an empty queue 

i e- boot(Q, T,G)\I! the bootstrap aims to fill Q up to containing T elements 
counter •<— 0; 
while CTi A Oeni do 

// i.e., while Ui is not the last step of list_MC(G)’s computation. 

(Ti+i ■<—next_step(list_MC(G), CTi); 
counter <— counter + 1; 

if (Ti+i = print (K) at line 1 of Algorithm^then 

Q append_on_tail((3, A); //do not perform the actual printing, but append K to the 
tail of Q instead 

if (IQI > 0 and counter > Tdeiay ) or |Q| > T + then 
K remove_from_head((5);// remove the head K Q 
print(i^);// perform the actual printing of A" 
counter -f- 0; 

i 

while \Q\ > 0 do 

K remove_from_head((5);// remove the head A" of Q 
print(A');// perform the actual printing of A" 
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Algorithm [6] takes as input an n-vertex graph G = (V^E)^ and provides a listing of all the 
maximal cliques K of G. An overview of the algorithm follows. As a Turing Machine can be 
programmed in order to simulate each step of the computation of any another Turing Machine, 
Algorithm [^performs a step-by-step simulation of the computation performed by Algorithmic] on 
input G. Given a generic step of such a computation, say we shall denote by di+i the next step 
within the sequence of all steps of the computation. In particular, we shall adopt the notation 
(Ji+i ■<— next_step(list_MC(G),ai). Stated otherwise, we are assuming that any invocation of 
Algorithm [5] on input G leads to the following sequence of steps of computation: 

(cTo, (Ti — next_step(list_MC(G'), (Jo), (J2 = iiext_step(list_MC(G), ai), ... , (Tend) 
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where each cr^ represents the execution of a particular line within the corresponding reference 
pseudocode. The rationale of this being that, at each one of those steps of execution cTj, Algo¬ 
rithm [S] assesses how to best manage a queue Q whose aim is to collect a suitable number of 
maximal cliques of G in order to sustain the time delay to scheme. 

At each ai, the course of actions taken by Algorithm |6] on Q depends on: 

1. the current size of Q, i.e., the number of maximal cliques that are inside Q at step (7^; 

2 . the numeric value of the current step-counter i reached by ai] 

3. the particular lin^l of Algorithm [6] that is executed at step ai ; 

In particular, every print (K) operation performed by Algorithm [5] is hooked by Algorithm |6l 
where the idea there is that of appending K to the tail of Q without printing it (immediately) as 
output, but with the intention to perform the actual printing operation later on, in such a way 
as to keep the time delay under 

Going into the details. Algorithm [5] is organized into three phases: (1) initialization, (2) 
bootstrapping, and (3) listing. These are described next. 

1. Initialization Phase. To start with, some variables gets initialized. At line 1, Tdeiay = 
Co= Comarks the time delay that the procedure aims to sustain. 
Here, Cq is some sufficiently large absolute constant (whose magnitude will be clarified in 
the proof of Proposition IT^ . At line 2, T = ci = ci is the 

number of maximal cliques that the bootstrapping phase will aim to collect (the magnitude 
of Cl will be also clarified in the proof of Proposition [151). Finally, at line 3, the queue Q is 
initialized to be empty. 


Algorithm 7: The Bootstrapping Phase. 
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SubProcedure boot{Q,T,G) 

Input: A reference to Q, a threshold T on the size of Q, the input graph G. 

Output: the index of the current computation step a;, that is reached after the bootstrap. 
(To •<— the starting step of Algorithm's computation sequence on input G. 
f •<— 0; 

while IQI < r or Oi ^ line 2 of Algorithm^do 
(Ti+i <—next_step(list_MC(G), (Ti); 
if (Ji+i = print (K) at line 7 of Algorithm^then 

Q ■<— append_on_tail(Q, A); // don’t execute print (A), append K to Q instead 
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i i — i 1] 


8 return i] 


2. Bootstrapping Phase. This phase begins (at line 4 of Algorithm 15)) by invoking Algo¬ 
rithm |3 on input {Q,T, G). The objective is to collect at least T maximal cliques inside Q. 
For this reason. Algorithm |3 starts a step-by-step simulation of Algorithm [5] on input G. 
The simulation starts, at line 1, by considering the first step (Tq of the computation. The 
subsequent steps of the computation are simulated within the while-loop defined at line 3 
of Algorithm^ by invoking next_step() at each iteration of line 4. Whenever Algorithmic 
performs a print (A) operation of some maximal clique K (which is checked at line 5 of 
Algorithm [7]) , then A is appended to the tail of Q at line 6 (without performing the actual 
printing operation). After that Q gets to contain at least T elements, Algorithm |3 extends 
the simulation of Algorithmic still for awhile. In particular, the simulation is extended until 

®i.e., the particular line within the pseudocode of Algorithm [5] and Algorithm |4] 
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the end of the current iteration of the while-loop at line 2 of Algorithm 0] which it is being 
simulated (for this reason, the condition “(7^ ^ line 2 of Algorithm 0]’ is checked at line 3 
of Algorithm [7]) . Finally, Algorithm [7] halts at line 8, by returning the current step counter 
i that has been reached so far. 

3. Listing Phase. The listing phase begins soon after, at line 5 of Algorithm 01 where a 
counter variable is initialized. Then, Algorithm [6] enters within the while-loop at line 6, 
whose purpose is that of continuing the same simulation of Algorithm [5] that Algorithm [7] 
had begun by bootstrapping. This time the simulation process will continue until the end, 
i.e., until last step (Tend of Algorithm [5l For this reason, the condition “cTi ^ (Tend” is checked 
at each iteration of line 6. Observe, that each step ai gets iterated to (Ti+i at step 7, where 
next_step() is invoked. Soon after, the counter variable is incremented at line 8, and 
then the current execution step (Ti+i is inspected at line 9: if di+i consists into a print (AT) 
operation (which may have been executed only at line 7 of Algorithm!?]), then the maximal 
clique K is appended to the tail of Q at line 10, and the actual printing operation is 
postponed. At line 11 the procedure checks whether it is time to execute an ouput printing, 
and this happens if and only if any one of the following two conditions is met: 

• Q is not empty and the simulation of Algorithm 0] performed more than Tdeiay steps 
since the last time that a printing operation was executed at line 13 of Algorithm [6] 
(to verify this, the condition “\Q\ > 0 and counter > Tdeiay” is checked at line 11). 

• Q contains more than T + elements (for this reason, the condition “|(5| > T + 
is checked at line 11 as well). 

If one of the above conditions is met, then a maximal clique K is removed from the head of 
Q at line 12, and it is outputted by executing print (K) at line 13. In this case, the counter 
variable is also reset to zero at line 14. At line 15, the step counter i gets incremented (so 
that to prepare the ground for the next step of the simulation). When the while-loop 
at line 6 is completed (i.e., when the simulation of Algorithm 0] reaches (Tend) then every 
maximal clique that is still inside Q is removed from it at line 17 and outputted soon after 
at line 18 (for this reason, the condition “|(5| > 0” is checked at line 16 of Algorithm 0]) . 

This concludes the description of Algorithm 0] 

We are now in position to prove Theorem 1. We shall go through a sequence of propositions. 

Proposition 17. On input G = {V,E), the procedure Algorithni\^ provides a listing of all the 
maximal cliques of G without repetitions. 

Proof. Recall that Algorithm 0] performs a simulation of Algorithm 01 and that it hooks all of 
the corresponding output printing operations. Also recall that, by Proposition 1141 Algorithm 0] 
outputs every maximal clique of G exactly once. This implies that every maximal clique of G 
must enter within the queue Q exactly once, either at line 6 of Algorithm [7] or at line 10 of 
Algorithm 01 With this in mind, from lines 12-13 and lines 17-18 of Algorithm 0] it follows that 
whenever a maximal clique K is removed from Q, then K is also printed out. Notice that, at 
lines 16-18 of Algorithm 01 Q is emptied anyhow. These facts imply the thesis. □ 

Proposition 18. AlqorithmYA (i.e., the bootstrapping phase of Algorithmic always halts within 
time: = 0(n-+3) = 0(n^-3728639j ^ 

Proof. Recall that Algorithm [7] keeps the simulation of Algorithm 0] going until the queue Q 
doesn’t get to contain at least T = = (3(7i3.27950i9^ elements. 
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By Proposition [121 Algorithm [5] collects T elements within the following time bound: 
= 0{n‘^+^) = 


Thus, Algorithm [3 also halts within time Tboot = 0(n‘^+^) = . □ 

Proposition 19. The time delay between the outputting of any two consecutive maximal cliques 
in Algorithm\^is: T^^iay = . 

Proof. Observe that every printing operation performed by Algorithm [5] is executed either at 
line 13 or at line 18. The time delay between any two consecutive iterations of line 18 is only 0(1). 
Thus, we shall focus on proving the thesis with respect to line 13. Let’s recall the functioning of 
Algorithm |6] and that of Algorithm |4l Consider any generic iteration of the while-loop at line 2 
of Algorithm m say the Lj iteration. Also, recall that Algorithm 0] firstly collects a batch 
of maximal cliques, through the execution of lines 4-7. Each maximal clique which is added to 
at line 6 would also be printed out at line 7 of Algorithm |4l However, all of these output 
printings are hooked at line 9 of Algorithm [51 Thus, each maximal clique K within is not 
printed out immediately (i.e., at the time of the hooking), instead K is added to Q soon after at 
line 10 of Algorithm |6l The rest of the analysis is divided in two cases. 


• Case 1. If = n^, then (as already observed in Fact 3 within the proof of Propo¬ 

sition (TSl) the simulation of the ij iteration of the while-loop at line 2 of Algorithm [4] 
takes time at most con^‘^*^^d,i/2) -^^hen n is large enough and for some absolute constant 
Co > 0 (whose magnitude highly depends on the rectangular matrix multiplication algo¬ 
rithm employed at line 5 of Algorithmic]). As already observed in Fact 2 within the proof 
of Proposition [Id all maximal cliques in are outputted with Oiri^) time delay. These 
facts imply that Algorithm [6| can remove one element from Q (at line 12) and print it out 
(soon after at line 13) every Tdeiay = cq steps, without ever emptying Q for 

this (provided cq is a sufficiently large constant, and provided n is large enough); stated 
otherwise, during the simulation of ij, at each iteration of line 11 it must hold \Q\ >0 
whenever counter > Tdeiay Thus, Algorithm [5] actually outputs a maximal clique of G at 
line 13 every Tdeiay = steps. 

As a side note, this also implies that at the last step of any such Lj the queue Q must 
contain at least as many elements as it contained at the first step of Lj. 


Indeed, observe that: 


Con2-(i44/2) ^ , 

Co|-„2a;(l.l,l/2)-2^ “ ” 




• Case 2. If < n^, then (as already observed in Fact 3 within the proof of Proposi- 

tion fTSl) the simulation of the Lj-th iteration of the while-loop at line 2 of Algorithm[4|takes 
time at most CQn“|Sf''^^| < when n is large enough and for some absolute constant 

Cq > 0 (whose magnitude highly depends on the square matrix multiplication algorithm that 
is employed at line 4 of childrenO within Algorithm [S|) . By Proposition Hd this Case 2 
can occur at most n times during the whole simulation of Algorithm 01 so that the total 
(aggregate) time complexity that may be consumed (across all such possible occurrences of 
Case 2) is at most Cgn'^ISf''^^|n < Con“+^. Now, recall that Algorithm [7] had collected at 
least T = Cl |'77,‘^-2i^(i,i,i/2)-i-5i maximal cliques inside Q, for some absolute constant ci > 0. 
Also recall that, at each occurrence Lj' of Case 1, the queue Q must contain at the last step 
of Lj> at least as many elements as it contained at the first step of Lji. Finally, let us assume 
(without loss of generality) that we had picked ci such that ci > Cq/cq. These facts imply 
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that, during any occurence ij of Case 2, Algorithm [5] can remove one element from Q (at 
line 12) and print it out (soon after at line 13) every r^eiay = Cq g^^gpg^ with¬ 

out ever emptying the queue Q for this (provided that co,ci are sufficiently large absolute 
constants, and provided that n is large enough); stated otherwise, during the simulation 
of 6j, at each iteration of line 11 it must hold \Q\ > 0 whenever counter > Tdeiay Thus, 
also in this Case 2, Algorithm |6] actually prints out a maximal clique of G at line 13 every 

Tdeiay = 0(n2-(l.l,l/2)-2^ ^ o(„2.093362^ 

Indeed, observe that: 


^a;+3 


CgP^2a;(l,l.l/2)-2^ 


< c^yi‘^-2<^(l,l.l/2)-r5 < 


Since there are no other cases to take into account, this suffices to conclude the proof. □ 

Proposition 20. The overall space usage of Algorithm\^ is 0(n‘^“2“(i.i,i/2)-i-6^ _ 

Proof. Recall that Algorithm [5] performs a simulation of Algorithm [SJ which consumes at most 
O(n^) space by Proposition llbl The queue Q maintained by Algorithm [5] can grow up to contain 
at most T + nf elements (because of lines 11-14 of Algorithm |6l). Since T = 0(n‘^“2i^(i,i,i/2)-i-5^ _ 
(;^(^3.2795019) definition, and each maximal clique has size 0(n), then the overall space usage 
of the procedure is 0(n‘^-2‘^(i4.i/2)-r6 ) ^ q(^4.2795019)_ □ 

At this point. Theorem [2] follows from Proposition [HI Proposition [19] and Proposition [SHI 


7 Conclusion 

In this paper, we improved the asymptotics for the exploration of the RS-tree associated to MCL 
by grouping the offsprings’ computation in a novel manner. In summary, our idea was to rely on 
rectangular fast matrix multiplication in order to compute all the children of maximal cliques 
in one single shot. The major open question, on this way, is that to understand whether or not 
the MCL problem admits 0(n^+°(^^) time delay algorithms that meanwhile maintain both the 
bootstrapping time and the working space polynomial in n. 
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A Appendix A: Analysis of the Time Complexity (Extended 
Version) 


In this section it is shown that, according to the current upper bounds on rectangular fast matrix 
multiplication in , the optimal size of the batch of maximal cliques B turns out to be |,B| = 

To start with, recall from Proposition |S] that Mg <3 can be computed by performing an \B\ x n 
by n X matrix product. Also recall that, by computing Ms,a, one solves in one single shot \B\ 
problem instances, i.e., for every P € B. Let fc G Q be such that \B\ = \n^\ ■ Then, computing 
M]g G, the amortized time Time[l^] for solving each problem for P £ B can be bounded as 
follows, where T±me\Mg,G\ denotes the time it takes to compute the matrix product Mg • Mg'- 


T ime I = 


1 = p/ Time[Z^] N ^ / Time[/Cg] + |g| 

J V |6| ; V ISI 


\B\ 


= O 
= O 


/Time [Mg^c] 




uj{k,l,2)—k 


Our aim would be to find k G [0, +oo)nQ such that uj{k, 1, 2) — k attains its global minimum value. 
Even though the exact values of u}{k, 1, 2) are currently unknown, one can nevertheless minimize 
the functions that arise from the currently known upper bounds on uj{k, 1,2). In this work we 
consider the bound /hp98 of Huang and Pan Q, and the bound /lgi2 of Le Gall Q. In particular, 
/lgi2 gives the best upper bound on a;( 2 , 1 , 2 ) = 2 a;(l, 1 , 1 / 2 ) which is currently known in the 
literature. These bounds have been obtained within the framework of bilinear algorithms 0,0. 
Indeed, presently and historically, all the known algorithms supporting the record asymptotic 
complexity estimates for matrix multiplication have been devised as bilinear algorithms. In such 
framework, the complexity bounds are expressed in terms of the minimum number of bilinear 
multiplications needed for the computation, as it can be shown that the number of arithmetic 
additions or scalar multiplications affect the cost only in a negligible way, see e.g. 00. Stated 
otherwise, it is known in the literature 0 that the minimum number R[m,n,p) of bilinear 
multiplications used in all bilinear algorithms for m x n hy n x p matrix multiplications is an 
appropriate measure for the corresponding (arithmetic) asymptotic complexity C(rn,n,p). 

The following equalities are also known in the literature, see e.g. 0 . 

uj{ar, as, at) = auj(r, s, t) {homogeneity) 

uj{r, s, t) = uj{r, t, s) = U}{s, r, t) = uj{s, t, r) = uj{t, r, s) = uj{t, s, r) 


Finally, we shall express our bounds by considering the following two quantities: 


a = sup{fc G Q I a;(l, 1, fc) = 2} > 0.30298 
w = a;(I,l,I) < 2.3728639 


(ref. 5 1 


(ref. 




A.l The upper bounds of Huang and Pan (1998). 

The first function gi is defined as follows: 

gi{k) = /HP 98 (fc) - k, for every k G [0, + 00 ), 

here, /hp 98 • [0, + 00 ) —>■ R is a piecewise-linear function, which was essentially pointed out by 
Huang and Pan in 0, it satisfies u}{k, 1, 2) < fnpgsik) for every k G [0, + 00 ) fl Q. 

Here below, we derive an analytic closed-form formula for /hp 98 (^). 
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• if fc £ [0,1) n Q, we consider the complexity exponent: oj{k, 1, 2) = uj{2, 1, k). 

The corresponding upper bound is provided in “Q, section 8.3, equation 8.2”, where e > 0 
is some small absolute constant: 


r 31 ^ 

/hP98(1) = i 2{l-a) + {l-k) + {cj-l){k-a) , 

I 1-a ' 


, if fc £ [0, a] 

, if A: £ (a, 1) 


• if fc = 1, we consider the complexity exponent a;(l, 1,2). 

The corresponding upper bound is provided in “Q, section 8.1, at line 13”: 

/hp98(1) = 3.334 


• if fc £ (1, 2) n Q, we consider the complexity exponent: 

uj{k, 1, 2) = w(2, k, 1) 

= kui{2/k, 1, 1/k) 


The corresponding upper bound is provided in “Q, section 8.3, equation (8.2)”, where e > 0 
is some small absolute constant: 


/HP98(fc) — k ■ 



(l-a) + (l 


~ j) + 

1 — a 



for every k £ (1, 2). 


• if fc = 2, we consider the complexity exponent: 

w(2,l,2) = 2cj(l,l,l/2). 

The corresponding upper bound is provided in “Q, section 8.2, equation (8.1)”: 

2(1 — r) + (r — a)oj 


/hp98(2) = 2 • 


1 — a 


= 4.2107878 

if fc £ (2, + 00 ) n Q, we consider the complexity exponent: 

uj{k, 1, 2) = uj{k, 2,1) 

= 2a;(fc/2,1,1/2) 

The corresponding upper bound is provided in “Q, section 8.2, equation (8.1)”: 


/HP98(fc) = 2 ■ 


^(1 — G;) + ^ + (w — 1)(5 — 01 ) 
1 — a 


+ e 


for every fc £ (2, + 00 ) 
and some small e > 0. 


In summary, gi can be dehned by the following formula, for every fc £ [0, + 00 ) and e > 0 is 
some small absolute constant: 


9l{k) = /HP98(fc) - fc = < 


3 — fc + e 

2(l-a) + (l-fc) + (t^-l)(fc-a) _h_i_ 
1 — a. ' 

2.334 

fc • 


|(l-a) + (l-.^) + (aj-l)(^-a) ^ 


1 —Q! 

2.2107878 

|(l-a) + i + (l^-l)(i-0!) 


2 -( 


+ e) - fc 


, if fc £ [0, a] 

, if fc £ (a, 1) 

, if fc = 1 
, if fc £ (1,2). 

, if fc = 2 
, if fc £ (2, +CX)) 
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The qualitative behaviour of gi is traced in Fig. [BJ with a filled blue colored line. 

At this point, observe that gi{k) is piecewise linear and that it attains its global minimum for 
fc = 2 , i.e., 

(7i(2) = min giik) = 2.2107878. 

fcG[0,+oo) 



k 


Figure 4: Plot of gi{k) for k £ [0, 5]. 


A.2 The upper bounds of Le Gall (2012) 

In a similar way, the second function g 2 is defined as follows: 

g 2 {k) = A:/LGi 2 (fc) - k, for every A: S {1} U [2, +oo). 

here, 52 takes into account the upper bound /lgi 2 (^) for a;(l, 1, 1/k), which was established by 
Le Gall in Q. We remark that, at the current state of the art, the upper bounds of Le Gall apply 
to oj{r, s, t) if and only if r = s. For this reason, when k £ (2, + 00 ) fl Q, we were able to apply 
Le Gall’s bounds on uj{k, 1,2) only by relying on the following upper bound: 

u}{k, 1,2) < ui{k, 1, k) (for every k >2) 

= k(jj{l, 1, 1/k) (by homogeneity) 

< fc/LG12(A:). 

In addition, we applied Le Gall’s bounds on n x nhy n x n? matrix products by considering the 
complexity exponent w(l, 1, 2), which is actually one of those explicitly studied by Le Gall in [^. 
Notice that, when k £ (1,2), it is not possible to apply the results of Le Gall to bound 
uj{k, 1, 2), because k ^ l,k ^ 2 and 1 7 ^ 2 so that the above mentioned condition (i.e., that r = s 
in tLi(r, s, t)) doesn’t apply in that case. This explains why g 2 ik) is defined on k £ {1} U [2, + 00 ). 
The qualitative behaviour, and many exact values, of /lgi2 were evaluated in [^, by solving a 
non-linear optimization problem with the computer program Maple (see “[^, Section 1, page 4, 
Table 1 and Figure 1”). Here above, in Fig. [5l we provide some data for g 2 {k). This allows 
us to show the qualitative behaviour of 52 , as it is traced in Fig. [3] with a dashed red colored 
line. In summary, the results in Q allow us to assert that ^ 2 ( 1 ) = 2.256689 > 2.093362 = ( 72 ( 2 ) 
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k 

92 

1 

2.256689 

0 . 5-1 

2.093362 

0 . 45-1 

2.2824489 

0 

1 

2.5304375 

0 . 35-1 

2.8653429 

0 . 34-1 

2.9463853 

0 . 33-1 

3.0331485 


Figure 5: Some values of 32 


and that g 2 {k) is monotone increasing in [2,+ 00 ). Concerning its global minimization, since 
^ 2 ( 1 ) = 2.256689 > 2.093362 = 52 ( 2 ) and since g 2 {k) is monotone increasing in [2,+ 00 ), the 
following holds: 

02 ( 2 ) = min g 2 ik) = 2.093362. 

fcG{l}U[2.+oo) 

The qualitative behaviour of 52 is traced in Fig.[6l with a dashed red colored line. The graphic 
shows that gi and g 2 perfectly agree on their argument of minimum value, which is fc = 2 . 

3.5 


3 


2.5 


2 

0 1 2 3 4 5 

k 

Figure 6: Plot of gi{k) for k G [0, 5], and of g 2 {k) for k G {1} U [2, + 00 ). 



Conclusion of Appendix A. In summary, both estimates and Q indicate that the mini¬ 
mum complexity comes at k = 2, namely, they both indicate that the optimal size of the batch 
of maximal cliques B is given by \B\ = v?. 


B Appendix B: A Reduction from to QSFI 

In this section it is shown how to reduce (i-e., the problem of generating all the maximal 
clique children C of any maximal clique parent P) to the following query problem named QSFI. 
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Problem: Query-Set-Family-Intersection (QSFI). 


They are fixed two positive integers m,n G N and a family of sets 
such that Sk C [n] for every k G [to]. 

Task: Fixed {m,n, {Sk}kG[m]), decide (possibly many) queries Q{P,k) of the 
following form: 

Q{P, k) = “is it true that P n 5^ 7 ^ 0 holds ?”, 

where P C [n] and k G [to]. 

The reduction to QSFI follows here below. 

Proposition 21 (Reduction from to QSFI). Let P be a maximal clique of any given n-vertex 
graph G = {V, E). For every i,j G [n], define the following sets Ai, Bj C [n]; 

Ai = n r(z) and Bj = r(j). 

Moreover, let m = and consider the set family {Sij}ij^[n] which is defined as follows: 

Si,j = Ai \ Bj, for every i,j G [n]. 

Now, consider an instance of QSFI in which (m, n, {<S'ij}ijg[„]) is fixed as we have just mentioned. 
Also recall that (i,j) is good with respect to P if and only if there exists u G P^i nr(i) such that 
{u,j} ^ E. Then, the following holds: 

{i,j) G [n] X [n] is good w.r.t. P -4=^ QiP, ihj)) is YES P n Sij 7 ^ 0. 

Proof. The proof is similar to that of Proposition[ 8 l To start with, observe that j G F is adjacent 
to all the vertices in P<i n r(i) if and only if (P<i fl r(i)) \ r(j) = 0. Equivalently, 

(i,j) G [n] X [n] is good w.r.t. P <;=> Pn ^(F<i nr(i)) \r(j)^ 7 ^ 0. 

Clearly, (F<j n r(i)) \ r(j) depends only on i,j and not on P, so that one can safely write this 
set as S'ij = Ai \ Bj = (F<i n r(i)) \ r(j). Thus, in order to assess whether {i,j) is good with 
respect to P, it is sufficient to check whether or not P n Sij 7 ^ 0 holds. □ 

In practice, in order to test the Q{P, k) queries, one could advantageously exploit some fast 
set intersection algorithms based on Bitwise-AND and SIMD instructions. However, it currently 
remains an open question to determine how these techniques compare in practice with some other 
well known algorithms for MCL such as the Bron-Kerbosch and derived algorithms. 
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